Spatial Hearing Algorithms Based on Binaural Zero-Crossings: Sound Source Localization, Segregation, and Dereverberation
ثبت نشده
چکیده
This thesis concerns a new zero-crossing-based binaural model for spatial hearing. Conventional binaural model computes cross-correlations of binaural signals for the estimation of the interaural time difference which is a primary spatial cue. However, the cross-correlationbased binaural processing model requires high computational complexity and suffers from inaccuracies in localizing sound sources especially in a noisy multisource environment. The proposed model extracts two important binaural cues of interaural time difference (ITD) and interaural intensity difference (IID) on the basis of zero-crossing times and interval powers of filtered signal. This fundamental difference on binaural cue extraction gives great flexibility on designing spatial hearing algorithms. Another distinctive feature of our model is to estimate the signal-to-noise ratios (SNRs) of filtered signal using the variances of ITD sample, enabling us to perform noise-robust estimation of ITDs using the estimated SNRs. Using the zero-crossing-based binaural model, we developed three novel algorithms on spatial hearing: localization, segregation, and dereverberation. • Localization: On the histogram of ITD samples weighted by the estimated SNRs, multiple sound source directions are localized in noisy environments. In the experiments on noisy multisource environments, the proposed localization algorithm provided more accurate noise robust estimation of sound source directions compared conventional cross-correlation-based method. • Segregation: Using the locations of sound sources, we assigned each zero-crossing interval power to one of the sound source to estimate the target-to-interferers power
منابع مشابه
Sound segregation based on binaural zero-crossings
This paper presents a new method of sound segregation based on zero-crossings generated from binaural filter-bank outputs. In our approach, sound source directions are identified using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). The estimation of ITDs is performed using zero-crossings generated from binaural filter-bank outputs to g...
متن کاملZero-Crossing Based Time-Frequency Masking for Sound Segregation
This paper presents a new method of zero-crossing based binaural mask estimation for sound segregation under the condition that multiple sound sources are present simultaneously. The masking is determined by the estimated sound source directions using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). In the suggested method, the estimatio...
متن کاملZero-crossing-based ratio masking for sound segregation
This paper presents a new method of zero-crossing based binaural mask estimation for sound segregation under the condition that multiple sound sources are present simultaneously. The masking is determined by the estimated sound source directions using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). In the suggested method, the estimatio...
متن کاملIntegrating Monaural and Binaural Cues for Sound Localization and Segregation in Reverberant Environments
The problem of segregating a sound source of interest from an acoustic background has been extensively studied due to applications in hearing prostheses, robust speech/speaker recognition and audio information retrieval. Computational auditory scene analysis (CASA) approaches the segregation problem by utilizing grouping cues involved in the perceptual organization of sound by human listeners. ...
متن کاملValidity and reliability of the Persian version of spatial hearing questionnaire
Background: Our hearing ability in space is critical for hearing speech in noisy environment and localization. The Spatial Hearing Questionnaire (SHQ) has been devised to focus only on spatial haring tasks (e.g., lateralization, distance detection and binaural detection). The aim of the present study was to determine the reliability and validity of the Persian translation of the SHQ (Spatial ...
متن کامل